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ABSTRACT 

We present an optimised galaxy cluster finder, 3D-Matched-Filter (3D-MF), that 



^ ' utilises galaxy cluster radial profiles, luminosity functions and redshift information 

to detect galaxy clusters in optical surveys. This method is an improvement over 
other matched-filter methods, most notably through implementing redshift slicing of 
the data to significantly reduce line-of-sight projections and related false positives. We 
apply our method to the Canada-France-Hawaii Telescope Legacy Survey Deep fields, 
finding ^170 galaxy clusters per square degree in the 0.2 ^ z ^ 1.0 redshift range. 
Future surveys such as LSST and JDEM can exploit 3D-MF's automated method- 
») . ology to produce complete and reliable galaxy cluster catalogues. We determine the 

reliability and accuracy of the statistical approach of our method through a thorough 
analysis of mock data from the Millennium Simulation. We detect clusters with 100% 
completeness for M 20 o > 3.0 x 1O 14 M , 88% completeness for M 20 o > 1.0 X 1O 14 M , 
and 72% completeness well into the 1O 13 M cluster mass range. We show a 36% multi- 
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pie detection rate for cluster masses 1.5 x 1O 13 M and a 16% false detection rate for 
galaxy clusters > 5 x 1O 13 M , reporting that for clusters with masses <5x 1O 13 M 
false detections may increase up to ~ 24%. Utilising these selection functions we con- 
clude that our galaxy cluster catalogue is the most complete CFHTLS Deep cluster 



C$ ■ catalogue to date. 
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1 INTRODUCTION 

Clusters of galaxies pinpoint the densest regions in the 
Universe. Housed in deep gravitational wells, clusters act 
as laboratories to study the influence of e xtreme environ - 
ments on galaxy formation and evolution (|Dressled 11980). 
The most massive clusters are also natural gravitational 
telescopes, lensing and magnifying li ght from the most 
distant of galaxies (|Stark et al.l [2007). As clusters trace 
the high mass tail of the matter distribution, a complete 
sample can prov i de a very sensitive probe of cosmology 
(|Battve fc Welled 120031 ). Directly probing the growth of 



structure in the early Universe, cluster cosmology is fast 
becoming an important component of future dark energy 
surveys. As the future of optical astronomy ushers in 
large datasets of wide, deep sky coverage, it becomes 
necessary to develop automated algorithms that method- 
ically and accurately search these datasets for galaxy 
clusters. Depending on scientific goals, it is desirable to 
have as complete a galaxy cluster sample as possible, 
over a range of redshifts; any intrinsic limitations of these 
search algorithms and the resultant biases introduced into 
generated galaxy cluster catalogues must be understood. 
The question of how to methodically select and quantify the 
completeness of a cluster sample is the subject of this paper. 
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Many different approaches to searching for galaxy clusters in 
astronomical data exist: current optical astronomy methods 
focus on finding clusters as overdensities via friends-of- 
frien ds algorithms such a s iLi fc Yeel |2008l ). density maps 
(2010), or Vo ronoi tesselation methods 
(120041 ). and Ivan Breukelen fc Clewievl 



as m |Ad ami et al 
as in lLopes et ~ 



(2009) for example. Other search methods look for large, 
red, elliptical cluster galaxy population s, and are re- 
ferred to as red seque n ce techniques llGladders fc Yee 
200C ). ICohn et al l (120071), iKodama et all (120071 ) . iLu et al. 
20091 ) . and iThaniavurl |200£ h for example), perhaps also 
including the existence of bright central galaxies into 
the search algor it hms (such as the maxBCG metho d 
of iKoester et all (|2007l ). and Iffilbert fc Whitel |2009h ). 
Alternatively, other methods search for clusters based on 
characteristic galaxy c l uster lumino s ity and radial profiles 
Post man et al.l (Il996l). lOlsen et al.l (ll999aT). iKepner et al.l 
19991). I White fc Kochanekl ||2002|). iKochanek et al.l J2003h. 



iGilbank et al.l d2004l). iDietrich et al.l (|2007l ). iGroveetal] 



(2009), 



Menanteau et al. 



(2009)). 



Each of these methods 
makes assumptions about general cluster properties and 
resultantly, derived galaxy cluster catalogues will primarily 
include galaxy clusters that reflect those assumptions. 
Ideally a method that minimises these biases and produces 
an understood and complete galaxy cluster catalogue will 
be best suited for statistical science of galaxy clusters and 
investigations into cosmology. 

With this in mind, we present an optimised luminos- 
ity function and radial profile based optical galaxy cluster 
finding algorithm that we call 3D-Matched-Filter (3D-MF). 
We present thorough tests of 3D-MF on simulations, 
and use the ascertained galaxy cluster selection functions 
to produce a galaxy cluster catalogue for the Canada- 
France-Hawaii Telescope Legacy Survey Deep fields. To 
this effect, Section [5] of this paper will discuss the Millen- 
nium Simulation catalogues used, Section [3] will discuss 
matched-filter methodology, as well as 3D-MF itself, and 
Section 0] will outline the selection functions of 3D-MF 
from the Millennium Simulation data (without and with 
photometric redshift error). Subsequently, 3D-MF will be 
run on the Canada-France-Hawaii Telescope Legacy Survey 
Deep Dataset and the resultant galaxy cluster catalogues 
will be discussed in Section [S] 

A ACDM cosmology has been assumed throughout 
this work: H = 73 km/s/Mpc, fi M = 0.25, Q A = 0.75 
(which is also consistent with the Millennium Simulation 
used herein). 



2 MILLENNIUM SIMULATION 
2.1 Catalogues 

The simulation catalogues used thro ughout this work are 
six pe ncil-beam mock catalogues from iKitzbichler fc White] 
(2007) (henceforth KW07). These catalo gues are lightcones 
created from the semi-a nalytic galaxy catalogue from 
|Pe Lucia fc BlaizotJ l|2007h . which in turn is derived from 
the Millen nium Simulation of the concordance ACDM 
cosmogony (|Springel et all l|2005l) : henceforth S05). 



The Millennium Simulation well-reproduces galaxy 
clustering as a function of luminosity as shown in Figure 
5 of S05. Furthermore, semi-analytic modelling applied 
to the original Mil l ennium Simulation catalogues, in the 
|Pe Lucia fc BlaizotJ j|2007h and KW07 catalogues, results 
in luminosity functions in similarly good agreement with 
observations. Consequently, these catalogues are well suited 
to test a cluster finding algorithm based on luminosity, such 
as the 3D-MF galaxy cluster finding algorithm presented in 
this work. Conversely, there are difficulties in reproducing 
observational colours in simul ation work (again, se e Figu re 
5 from S05 and a discussion in lDe Lucia fc Blaizotl l|2007h ). 
This fact hinders the analysis of colour-based cluster finding 
algorithms, such as red s equence bas e d tech niques, on the 
Millennium Simulation. ICohn et alj (120071 ) tested their 
red sequence algorithm to examine changes with redshift 
using the Millennium Simulation. The authors found the 
simulation colours did not match observations; they instead 
determined their red sequenc e by matching the simula tions. 
In another recent example . iHilbert fc White] |20 09 : ) used 
the maxBCG method from lKoester et al.l (|2007T ) (based on 
the red sequence technique) on the Millennium Simulation 
and found that the simulation did not reproduce the colours 
of passively evolving galaxies to the degree required for 
a direct application of colour-redshift relations derived 
from observations. Resultantly, they used altered colour- 
redshift relations, noting that if they had not adjusted 
the colour-redshift selection relations by hand, running 
maxBCG on the Millennium Simulation dataset and subse- 
quent modelling would have found almost no clusters with 
z > 0.25. The 3D-MF technique does not use Millennium 
Simulation colour information, and relies instead on the 
luminosity function of galaxy clusters; therefore, evaluating 
on the Millennium Simulation provides a robust test of the 
application of this method to real data. 

The KW07 mock catalogues used herein consist of six 
deep fields of 1.4 x 1.4 square degrees and mark the first 
time these lightcones have been used in cluster finding work. 
3D-MF uses redshift information: this is not obtained from 
Millennium Simulation colours. Magnitude information is 
used from KW07 but only for the purpose of calculating 
luminosity functions as described. The SDSS magnitudes 
were converted to MegaCam magnitudes (according to 
Equation \ffll ) in anticipation of the Canada-France-Hawaii 
Telescope data that 3D-MF will first have the opportunity 
to examine in Section [S] 

iMegaCsm = «SDSS — 0.085 (rsDSS — «SDSs) (1) 

Hereafter we refer to iMegaCam as i! — band. 

2.2 Millennium Simulation Galaxy Clusters 

A common definition does not exist in the literature with 
which to define a cluster, other than the agreed upon fact 



1 The Canadian Astronomy Data Centre, Herzberg Institute of 
Astrophysics, MegaCam ugriz filter set as per: 
http: / /www2.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/megapipe/ 
docs / filters .html 
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Table 1 . Parameters used to compile a list of known galaxy clus- 
ters from the KW07 mock lightcones of the Millennium Simula- 
tion. 



Millennium Simulation Galaxy Cluster Traits 

• Cluster members have the same friends-of-friends 
identification number (as described in the simulations), 
thus avoiding dark matter substructures 

• Cluster members are flagged in the simulations 
with the M200 of their parent halo 

• Clusters have ^ 5 galaxy members 




Mass of Simulation Galaxy Cluster (M 2 ix>); xlO 10 M Q 



Figure 1. Mass distribution of Millennium Simulation KW07 
galaxy clusters (following the cluster definition given in Table 
[T] and showing the cluster redshift range of 0.2 ^ z ^ 1.0 as 
explained in the text). 

that a cluster consists of multiple galaxies gravitationally 
bound to one another orbiting a common centre of mass. 
Smaller numbers of galaxies bound together are more tradi- 
tionally known as groups, whereas larger galaxy associations 
are typically called clusters; it is the border between groups 
and clusters that is not always strictly defined. For the pur- 
poses of this work, a cluster will be defined in the Millen- 
nium Simulation as a collection of galaxies that belong to 
the same parent halo and have ^ 5 members (where mem- 
bership is defined by examining the Millennium Simulation 
catalogues to z < 1.2); traits of clusters are listed in Table 
\T\ Note that M200 is the mass enclosed within radius r2oo- 
the radius within which the mean density is 200 times the 
critical density at that redshift. Based on this galaxy cluster 
definition, the mass distribution of KW07 galaxy clusters is 
shown in Figure [T] The redshift distribution of these known 
clusters is presented in Figure [5] 

The centre for each known Millennium Simulation cluster is 
defined in this work as the average position of the brightest 
10 (or average of all members when the number of galaxies 
per cluster is < 10) i! — band objects per cluster. The position 
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Figure 2. Redshift distribution of Millennium Simulation KW07 
galaxy clusters (following the cluster definition given in Table [l] 
and noting the catalogue lightcones extend much further than a 
redshift of 1.2: higher ranges are not shown here). 

of the centre is only used when matching the clusters found 
by 3D-MF to those known in the Millennium Simulation to 
determine which clusters were found, and a tolerance range 
is used (described in Section [4]); it is not crucial to immedi- 
ately determine a more exact position of the cluster centre 
and it will be examined further in Section \4. 3. 21 



3 THE 3D-MF ALGORITHM 
3.1 The Foundations 

The 3D-MF galaxy cluster finding algorithm bases its search 
on the luminosity and radial profile of a galaxy cluster, ap- 
propriately sized for the redshift of the cluster. Prior to 
implementing our cha nges (described in S ection 13.30 . this 
method was based on (|Postman et al.lfl996l . hereafter P96), 
and the reader is directed there for a more in-depth dis- 
cussion of the background of this technique. The method 
can use any sensible choice of luminosity or radial profile to 
model a galaxy cluster: 3D-MF follows P96 and currently 
uses a modified Schechter luminosity function (Equation HJ 
and truncated Hubble radial profile (Equation [3]) to describe 
a fiducial galaxy cluster: 

$(M) = 0.4 lnl0<ri0 a4(a+1)(M *- A/) exp [-10°' 4(iu * " M) ] (2) 

where $ is the galaxy luminosity function, <E>* sets 
the overall normalisation of galaxy density, M is absolute 
magnitude, and a is the slope of the faint end of the lumi- 
nosity function. M* is a characteristic absolute magnitude: 
$ drops with increasing luminosity but at magnitudes 
brighter than M* the exponentially decreasing slope of 
the luminosity function cuts off dramatically. The integral 
of the Schechter luminosity function diverges for a < — 1 
at magnitudes fainter than M*; the multiplicative term, 
exp[-10°- 4(M * _M) ], that has been added to modify the 
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Schechter equation, is a dimensionless power-law cutoff 
(weighted by the flux of the galaxy) that allows the function 
to remain integrable. 



detections can be filtered based on their significance, ensur- 
ing detections are well above the background noise, and in 
this way a galaxy cluster list can be generated. 



The radial profile of the cluster is modelled as a pro- 
jected truncated Hubble radial profile; 
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where r is the distance of a galaxy from a cluster centre, 
r c is the cluster core radius, and r co is the cutoff radius. 
r co is arbitrary and must be chosen such that it extends 
well beyond the cluster's core radius, importantly allowing 
non-circularly symmetric clusters to still be detected by 
this technique. 

Once the fiducial cluster has been established by choice 
of luminosity and radial profile, the data is searched 
in one wavelength band for areas that maximally and 
simultaneously match both profiles, thereby filtering 
galaxy clusters out of the data (and thus the terminology 
'matched-filter'). In previous applications of matched-filter 
methodologies, the data have not been binned per redshift 
as photometric redshift information has only recently 
become commonplace. In the absence of photometric 
redshift information in the past, the profiles that were 
being matched were themselves re-determined with an as- 
sumed cluster size and M* value at a series of trial redshifts. 

The likelihood, C, that galaxies within r co match the 
luminosity and radial profile of a fiducial cluster at a 
particular assumed redshift is calculated according to 
Equation [4] 
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D(r, m) d 2 r dm 



(4) 



where m is the apparent magnitude of a galaxy, m* is 
the apparent magnitude corresponding to the characteristic 
luminosity of cluster galaxies, which incidentally includes 
a redshift dependent fc-correction, b(m) is the background 
galaxy count, and D(r,m) is the total number of galaxies 
at a given magnitude and distance from a cluster centre. 

This likelihood is calculated for every possible cluster 
centre in an input catalogue and can be viewed as an array, 
or image map; these maps will be referred to as likelihood 
maps. The likelihood maps are searched for peaks, which 
are galaxy cluster detections. 

The significance of a peak, or galaxy cluster detection, is 
measured according to Equation [S] 



cr = (Sp- S b g)/a bg 



(5) 



where S p is the peak signal, and St g is the background 
signal calculated by binning the distribution of all likeli- 
hood values in a likelihood map, and taking the mode of 
this Gaussian distribution. The background dispersion, a\, g , 
is 0.741 x (Q3 —Qi) where Qi and Q3 are the first and third 
quartiles of the ranked likelihood value distribution. Cluster 



3.2 2D False Detection Rates 

False detections are defined as galaxy clusters found by 
an algorithm that do not exist in reality: an important 
parameter, among others, by which to define the purity of 
an algorithm. Older versions of matched-filter techniques 
have uncomfortably high false detection rates, a feature 
largely attributable to line-of-sight projections. 

P96 test their matched-filter algorithm on a self-created 
simulation (mimicking the Palomar Distant Cluster Survey) 
and find a 65% false detection rate of galaxy clusters 
with peak signal thresholds ^ 3cr. P96 reduce this false 
detection rate to 31% by adding the criteria that clusters 
must be found by their algorithm twice, looking separately 
in 2 colour bands. The ESO Imaging Survey had some 
success in finding clusters using their implementation 
of this same algorithm, but it was initially developed 
with the purpose of preparing a list of candidate clus- 
ters for follow-up observations and not for producing a 
well-de fi ned sample for sta tistical analysis l|Qlsen et al.l 
jl999al ). [ois"en et all jl999bl ); 01&2). These ESO Imaging 
Survey clu ster papers, and sub sequent work with their 
algori th m dDietrich et alj (|2007l ): D07; and lOlsen et af] 
( |2007f ). iGrove et al.l |2009l ): 03&4), have not tested their 
version of the matched-filter algorithm on simulations in 
terms of its ability to recover known clusters, other than 
to very generally quantify the ability of their finder to 
avoid noise detections. Furthermore, these papers aim for 
the same galaxy cluster density as the Palomar Distant 
Cluster Survey in P96 (i.e. 03 quotes ~ 52.1 clusters 
per square degree), and use the same method, implying 
a similar false detection rate to that quoted above (03 
claims 16.9 ±5.4 false detections per square degree, or 33%). 



iKepner et aD l| 19991 ) implement a matched-filter algo- 
rithm based on P96 as well, but do not examine cluster 
recovery numbers. Instead, they focus on the accuracy 
of their algorithm to detect a given cluster at its proper 
redshift, or the accuracy of determining the proper rich- 
ness of a given cluster. White and Kochanek implement 
a matched-filter algorith m based on Kepner's version 
jWhite fc Kochanek! l200l . hereafter WK02) and test on 
self-created simulations to examine their cluster finding 
abilities; however, they find their simulations contain too 
few galaxies compared to real observational surveys: in 
fact, 60% too few galaxies for an i?-band magnitude < 24. 
This limitation affects complications caused by line-of-sight 
projections, as well as luminosity functions integral to 
the matched-filter methodology, and thus efficiency rates 
of their algorithm comp ared to real data ar e reported as 
conservative estimates. IKepner et all j 19991 ) and WK02 
use different likelihood functions from Equation 2] in this 
work, as well as different peak detection methods, and the 
reader is directed to those papers for more details. WK02 
make no adjustments to improve cluster detections at the 
edges of their data. These edge effects become important 
when using cluster finders on real data: masking bad pixels 
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or bright stars is necessary in reality and clusters on the 
borders of the masked regions, if not properly weighted or 
accounted for in terms of the fraction available to the search 
algorithm, will likely be missed. In terms of false detection 
reports, WK02 show how their false detection rates change 
with redshift: for their deepest self-created simulation of 
1.5 x 1.5 square degrees out to redshift 1.0, with a limiting 
R band magnitude of 24, they have false positive rates of 
> 40% (averaging « 60% for all redshift bins) for clusters 
^ 2 x 10 14 M©. 

Proponents of other cluster finding techniques, such 
as red sequence based methods, often comment upon 
older matched-filter methods' high false detection rates 

as a reas on to avoid t his te chnique l|Gladders fc Yed 

(|2000h or iGilbank et all \2G04 ) for example). However, 
the false detection rates quoted above are largely caused 
by line-of-sight projections in the data; removing this 
contamination will lead to much improved false detection 
rates and push down the galaxy cluster mass limit to which 
reliable detections can be made. Arguably, red sequence 
based methods have their own line-of-sight projection 
issues. iGladders fe Yeel (2000) for example introduced their 
red sequence based cluster finder by testing it on known 
CNOC2 clusters (i.e. not simulations, but real data), and 
found excellent false positive rates of < 5% for clusters at 
redshifts < 0.5. However, a s WK02 also show, false positives 
increase with redshift. The lGladders fc Yed (|200Ct ) CNOC2 
sample testing provides limitations in the higher redshift 
ranges: notably a 36% false detection rate in the redshift 
range 0.5 < z < 0.6 (where authors suggest spectroscopic 
data was inco mplete and did not d educe anything further). 
Furthermore, iKoester et all (|2007l ). using their maxBCG 
method (based on the red sequence method), find 16% of 
found galaxy clusters with shallow redshifts (0.1 < z < 0.3) 
are in reality line-of-sight projections (as tested on the 
observational dataset of the SDSS). As all optical cluster 
finders have false positive rates that will increase with 
redshift, it becomes a goal of this work to minimise these 
false detections as much as possible. 

Intrinsic biases will exist to some degree in all meth- 
ods. For example, red sequence based methods have their 
own intrinsic biases about galaxy clusters, notably assum- 
ing all galaxy clusters have large enough red sequence 
populations to be detected based on red sequence alone. 
Complimentary galaxy cluster finding techniques with 
different biases are therefore necessary, first to determine 
whether clusters can be found that are missed by other 
techniques, and second to be able to ultimately construct 
a galaxy cluster list from observational data that is as 
complete as possible, and whose biases are well-understood 
(via testing on simulations). 

3.3 3D-MF's Advancements 

Using the foundations of the method discussed above in Sec- 
tion 13.11 3D-MF advances the matched-filter technique by 
incorporating photometric redshift information, and adjusts 
additional elements as described below, to update and im- 
prove galaxy cluster finding. With the requirement that all 
next-generation surveys for cosmology come with photomet- 



ric redshifts, it is natural for 3D-MF to extend the matched- 
filter technique into this third dimension. 

3.3.1 Optimal Use of Photometric Redshifts 

3D-MF cuts the input data by redshift, initially into non- 
overlapping redshift slices. Each slice is then matched 
against appropriate luminosity and radial filters scaled for 
that particular redshift slice. The resultant likelihood maps 
(described in Section ^. ll with Equationf?]) are run individu- 
ally through a new 3D-MF peak detection pipeline to iden- 
tify galaxy clusters. Upon completion of a range of sequential 
redshift slices the entire process is repeated with a shift in 
the slices (thus overlapping the redshift slicing with the ini- 
tial run) in order to account for clusters or structures that 
exist on borders of the original slicing. For example, an input 
catalogue could be sliced from redshift 0.00 to 1.20 (with a 
bin width of 0.2, but any desired bin width could be cho- 
sen) and these would be considered the original slices. After 
running 3D-MF on these original slices, the process is re- 
peated by slicing the whole input catalogue differently, from 
redshift 0.10 to 1.30 (with the same bin width for exam- 
ple), and these would be considered the shifted slices. The 
resulting galaxy cluster lists from a complete run of original 
slices and a complete run of shifted slices are then merged 
(the details of which are presented in Section [4} . Clusters 
that are multiply detected can be tracked, and false detec- 
tions in either the original or shifted slices can be lowered 
by associating them properly with clusters from the shifted 
or original slices respectively. The aim of 3D-MF is to opti- 
mise cluster detections through redshift slicing, thus elimi- 
nating line-of-sight projections, but also to take into account 
the fact that each cluster spans a range of space and subse- 
quently redshift slicing has to be applied carefully to not cut 
these clusters and count them multiple times, or conversely 
to miss them entirely. In optimising 3D-MF a variety of slic- 
ing styles were examined (various slice widths, non-uniform 
slice widths, differing overlapping slice amounts, etc), and 
the slicing method that produced the best results (as seen 
in Section l4.2.1[l used redshift slice widths of 0.2, with shifted 
slices overlapping original slices by a redshift of 0.1. 

3.3.2 Masks 

3D-MF accounts for masked regions of data (obscured by 
bad pixels, star diffraction spikes, or data regions of low 
signal-to-noise for example) by rejecting potential cluster 
candidates that are masked above a set percent (given as 
an input threshold, chosen here to be 50%). In cases where 
an important fraction of a cluster's luminosity function is 
missing due to a mask, it simply cannot be rebuilt, and 
missing optical information such as this is a limitation 
of optical galaxy cluster finding algorithms, including 
matched-filter methods. 

Partially masked clusters are weighted by 3D-MF ac- 
cording to the fraction of the cluster that is masked 
(Equation [5} ; 

*-Mask weighted — £ (1 + %masked/100) (6) 

where the likelihood, £ (from Equation [4} , is up- weighted 
by the percentage of the redshift-scaled radial filter that is 
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masked, giving £ Ma sk 



weighted • 



The mask- weighted likelihood maps, /3]v[ask weighted , in- 
crease the likelihood that a partially masked cluster 
matches the fiducial cluster at a given redshift, as its lumi- 
nosity and radial profiles are up-weighted to compensate for 
the information missing due to the presence of the mask. 
Importantly, mask-weighting also improves detections of 
clusters near the edges of an image as clusters previously 
missed in these regions were found after the addition of this 
criteria. 



3.3.3 Background Galaxy Counts 

The requirement for an accurate measure of the background 
galaxy counts (b(m) in Equation 2]) is an issue in P96 
and other implementations of matched-filter such as Ol- 
4 discussed in Section 13.21 These older implementations of 
matched-filter methodology assume a priori that b(m) can 
be modelled by a power law in m, or they determine b(m) 
by creatin g self-simulated backgrou nds (following the pre- 
scription of lSoneira fc Pee bles 1978). 3D-MF determines the 
background counts by binning the magnitude distribution 
of the input data and subsequently interpolates within this 
measured distribution to determine b(m) for each galaxy (i.e. 
pulling out an interpolated background number count for a 
particular galaxy magnitude), including it as a data-driven 
b(m) in the integral over cluster galaxies in Equation 2] We 
find that this accurately models the likelihood of a cluster 
signal against a true data-derived background and increases 
a cluster's likelihood of detection with the advantage that 
our method does not rely on biased modelling of b(m). 



3.3.4 Peak Detection 

Previous matched-filter methods (implementations of P96 
such as D07 and 01-4) detect peaks in the likelihood maps 
using SEXTRACTOR; while SEXTRACTOR is optimised for 
the detection of isolated peaks, it tends to blend things. 
Although this is excellent for object detection, the likeli- 
hood maps created with a matched-filter algorithm are more 
akin to continuous maps and thus need an appropriate peak 
detection method. Furthermore, SEXTRACTOR is parame- 
ter dependent; we desire a robust method that withstands 
reasonable changes in parameter space. 3D-MF detects all 
peaks above a user-input threshold (such as 3.5a) and 
thereby generates a complete list of possible cluster candi- 
dates. 



3.3.5 Assigned Cluster Redshift 

Other versions of the matched-filter technique assign a 
redshift to a galaxy cluster where it is maximally detected 
(i.e. the redshift at which the fiducial cluster is maximally 
matched) but in practise this often assigns an incorrect 
redshift to the cluster because all data is considered at 
all redshifts. Several other factors also contribute to this 
incorrect redshift assignment, among them the assumed 
background counts discussed above, the fc-correction (and 
the effect on this of the assumed evolution of various 
galaxy types), and the fact that the cluster signals become 



over-corrected via the normalisation of the luminosity filter 
(as in D07 and 01-4). These methods apply a cluster signal 
correction factor to counteract some of these points, but 
resulting redshift assignments are still, on average, incorrect 
and thus are not robust (nor do they claim to be; see 01&2). 

A plethora of papers followed up on the clusters 
found in 01&2 with the goal of obtaining spectro- 



jRamella et al. (1200(f), Hansen et al.l ( 


2002), iBenoist et al. 


l2002h. Olsen et al. J2003l).l01sen et al. 


(2005l)a. Olsen et al. 



(|2005r) b. iGrove et all (|2008|)). Not all clusters were con- 
clusively found in the spectroscopic analysis; of the ones 
that were, the spectroscopic redshifts for these systems 
were compared with their matched-filter found redshifts. 
Unfortunately, number counts are too low to ascertain 
anything statistically (i.e. the sample size available to 
compare spectroscopic and photometric redshifts was often 
only a few galaxy clusters per paper). 

Also investigating redshift comparisons, galaxy clus- 
ters in D07 have redshifts derived from their matched-filter 
implementation and a sample of these are compared to 
known spectroscopic redshifts from the literature. These 
authors find their matched- filter has a bias toward lower 
redshifts and conclude they are underestimating the true 
redshift of galaxy clusters; a discrepancy that increases 
with higher redshifts. 

As 3D-MF requires photometric redshift information 
for all of the objects in a catalogue and slices the catalogue 
according to redshift, once each galaxy cluster is found, 
its associated redshift slice is automatically, immediately 
and more accurately known. For example, for a redshift bin 
width of 0.2, with overlapping bins shifted by a redshift 
of 0.1 from the original bins, consider a cluster that is 
first detected in an original bin: the cluster is assigned the 
redshift of the centre of the bin and this cluster centre is 
thereby known to within ±0.1 in redshift (i.e. the width 
of the bin). After repeating the cluster finding process 
with shifted slices (as described above in Section I3.3.ip . 
this cluster may also be detected in an overlapping shifted 
slice, localising the redshift of its centre even further (to 
roughly ±0.05 in redshift: the width of the overlap region 
of the original and shifted slices). This of course assumes 
that photometric redshifts are without error and unbiased. 
Knowing a cluster's galaxy members (and their redshifts) 
leads to an even more localised determination of a cluster's 
redshift; this will be discussed in relation to 3D-MF in 
Section S3U 

3.3.6 Parameter Optimisation 

3D-MF has a variety of parameters that can be optimised for 
maximal galaxy cluster detection. Using Millennium Simu- 
lation data to test 3D-MF has proven irreplaceable in its 
contribution to improving 3D-MF, in particular determining 
optimal parameter settings to apply to real data, assuming a 
similar cosmology. A discussion of these parameter settings 
follows in Section 0] 
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Table 2. 3D-MF basic run-time parameters (consult the text for 
descriptions of each parameter). 

3D-MF Input Parameters 



Schechter Function (Equation [2]l 



V — band 


-20.69 


a 


-1.11 


Hubble Profile (Equation 


r c 


O.lMpc 


Tco 


IMpc 


Likelihood Map (Equation 3} 


Scale 


0.124-U 






Peak Detection 


Mask Weighting 


C (1 + %masked/100) 


Cluster Rejection 


if ^ 50% masked, reject 


Minimum Significance 


> 3.5cr 


(Equation [5j for Real 




Detection 




Multiple Detections 


2D Grouping (RA,Dec) 


1.5Mpc diameter 


in Same Redshift Slice 




Grouping Between 


^3D-MF ± 0.1 


Redshift Slices 





4 3D-MF ON MILLENNIUM SIMULATION 
MOCK LIGHTCONES 

In this section we present the application of the 3D-MF al- 
gorithm to the Millennium Simulation KW07 mock cata- 
logues. We first assume exact redshifts are known for all 
galaxies, and then investigate the impact of adding photo- 
metric redshift errors to the data, on 3D-MF's selection func- 
tions. When matching the 3D-MF found clusters to known 
mock clusters, we will use the known clusters above a mass of 
1.5 x 10 13 M Q , and focus on the redshift range 0.2 z < 1.0 
for reasons that will be explained. 

4.1 3D-MF Parameters 

The Millennium Simulation mock lightcone catalogues 
were cut at an i'— band magnitude of 25.5 to simulate a 
magnitude-limited survey. The catalogues were then cut 
into optimal redshift slices (see Section [4727T]) : redshift slice 
widths of 0.2 and shifted slices shifted from original slices by 
0.1 in redshift (i.e. original and shifted slices overlap by 0.1 
in redshift). The mock catalogues were then analysed with 
3D-MF, as described in Section [3] using the parameters 
listed in tabled 

M* and a (Equation[2]) were obtained by fitting a Schechter 
luminosity function to mock galaxies and found to be 
M *'-band = -20.69 ± 0.24 and a = -1.114 ± 0.028. We 
optimise 3D-MF to find clusters with masses > 1O 14 M0 
by setting the cutoff radius, r co , to be IMpc based on 
the r2oo radius of a 3 x 1O 14 M0 cluster. This is suffi- 
ciently large to enclose non-symmetrically shaped clusters 



and thus allow them to be detected, but still small enough to 
maximally prevent neighbouring or on-the-verge-of-merging 
galaxy clusters from becoming blended into one cluster de- 
tection. P96 explored a range of cutoff radii and found the 
significance of their cluster detections dropped by up to 40% 
when increasing r co beyond IMpc, but didn't test decreas- 
ing this window size significantly within IMpc. However, 
P96 notes that when cluster dimensions are significantly 
larger then the cutoff radius, the signal will obviously be 
truncated, though the degree depends on cluster shape. In 
the Millennium Simulation analysis that follows we find our 
choice of r co is very successful at detecting clusters of masses 
> 1 x 1O 14 M0 and has a very useful by-product of also de- 
tecting clusters with masses as low as ~ 1O 13 M0. As the set 
cutoff radius for these low mass clusters is significantly larger 
than their extent, it will be difficult to interpret our results 
for these objects as we discuss further in Section [473] Future 
implementations of 3D-MF will experiment with optimising 
low mass detections using a varying cutoff radius. 

4-1.1 Single Galaxy Cluster Detection Criteria 

Following 3D-MF's search of the Millennium Simulation 
mock lightcone catalogues, the output detected galaxy clus- 
ter list was compared with the known Millennium Simula- 
tion galaxy clusters (see Section 12. 2[) . To match a galaxy 
cluster detection with known galaxy clusters, a cut was 
first made in a projected two-dimensional radius, set at 
0.044 degrees. This value was chosen due to the fact that 
0.044 degrees at a mid-redshift range, z ~ 0.55, corre- 
sponds to ~lMpc, the assumed cluster cutoff radius (r co ) 
for an individual cluster as per Table [2] and notably at the 
higher redshift end, z — 0.95, 0.044 degrees corresponds 
to ~1.25Mpc remaining a reasonable choice there as well. 
Note that sizing this tolerance per redshift did not signifi- 
cantly change results. Therefore, any galaxy cluster detec- 
tions within 0.044 degrees of a known Millennium Simula- 
tion cluster were considered possible candidates. From the 
list of candidates, each 3D-MF found cluster was matched 
in 3D space (RA, Dec, z) to the closest known cluster and 
that known cluster is thus considered detected. The redshift 
component of this matching was performed within ranges 
of Z3d-mf ±0.1 for redshifts > 0.6 (where Z3d-mf de- 
notes the 3D-MF derived cluster redshift, and ±0.1 indi- 
cates that this matching was done within a redshift slice), 
and 23d-mf ± 0.1 ± 10%(of slice width) for z $J 0.6. Note 
that lower redshift clusters will be spread out in redshift bins 
more appreciably than higher redshift clusters in relation to 
the bin volume; we wanted to avoid missing proper matches 
between 3D-MF clusters and known mock clusters seemingly 
spread out due to this effect and thus a slight widening of 
the matching redshift parameter for lower redshift clusters 
was chosen. 

4-1.2 Multiple Galaxy Cluster Detection Criteria 

As per the discussion in Section 12.21 it can easily become 
complicated to define a galaxy cluster. For example, in 
some cases it can become difficult to realistically determine 
whether a multiply detected galaxy cluster is either a) 
one larger known galaxy cluster detected multiple times 
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(for instance consider a merging system in a slightly 
'dumbbell' shape that still has two clear lobes from the 
individual galaxy clusters, each lobe separate and large 
enough to be detected individually despite the chosen 
r co value), or b) from one detection of a cluster and a 
second detection is instead from smaller known nearby 
clusters in the lower mass ranges. For this reason, we chose 
to track multiple detections in three-dimensional space, 
and apply grouping criteria. Grouping 3D-MF detections 
to 3D-MF detections by a physically motivated linking 
length, and calling these linkages multiple detections, we 
can track multiple detections within each redshift slice, as 
well as between redshift slices. We wanted to avoid falsely 
assigning detected clusters to the wrong known galaxy 
clusters and to track multiply detected clusters at all mass 
ranges separately from false detections. Resultantly, any 
3D-MF detections within a predetermined linking length of 
each other were assigned a similar grouping identification 
number. This linking length was chosen to be 0.75Mpc 
(in two dimensions: RA and Dec) based on the physically 
motivated assumption that if one 3D-MF detection could 
be exactly at the correct known centre position, and noting 
that clusters typically span 1.5Mpc in diameter, this would 
put a detection at 0.75Mpc away from the correct centre 
as a maximal radius by which to associate detections. The 
third dimension of linking length, redshift, was selected 
to be Z3D-MF ±0.1 (i.e. joint membership in an original 
slice and an overlapping shifted slice) and was not chosen 
to be 0.75Mpc as in the two-dimensional linking, simply 
because the res olution in redshift s p ace i s unfortunately 
not this high. IWhite fc Kochanekl l|2002l ) implement a 
similar method and drop cluster candidates when a more 
likely cluster detection exists within a projected separation 
of l/i _1 Mpc and redshift difference of ±0.05. 3D-MF's 
multiple detection grouping criteria are summarised in 
Table [2] It is important to note that 3D-MF detections 
were associated to each other and the known cluster centres 
were not used in this step, because we wanted to develop a 
method that would transfer to real data where centres are 
always unknown. 

The best match in a multiple detection grouping is 
chosen to be the one that is closest to a known simulation 
cluster centre. We point out to the reader that this choice 
is a difficult one, as there are known issues regarding 
centroiding; how one decides the true centre of a cluster is 
an interesting science topic many papers devote themselves 
to in their entirety. We will provide evidence in Section 
14.3.21 that serves to support our choice of best match, and 
plan to investigate multiple matches and centroiding in 
future work. Our selection function plots are free of multiple 
detections not considered best matches as we believe these 
are duplicate detections of single clusters. 



4-1.3 False Detection Criteria 

False detections were those 3D-MF cluster detections that 
did not match to any known galaxy clusters after the above 
matching and grouping criteria were applied. As described, 
the cluster finding process is repeated with original and 
shifted slices. In order to ensure each false detection was 
not a 'faint' detection of a real structure (for example, to de- 



termine whether a false detection was actually found more 
significantly in a different redshift slice, perhaps due to a 
cutting of the cluster), and to merge original and shifted 
slices, the false detections from the original slices were cross 
referenced in RA, Dec, and redshift with real cluster detec- 
tions from overlapping shifted slices (and vice versa). The 
remaining false detections will be quantified and are likely 
due to noise fluctuations or subsisting projection effects. 

4.2 Modelling Photometric Redshift Errors 

Photometric redshift errors were modelled for this work 
by examining the area of overlap between the Canada- 
France-Hawaii Telescope Legacy Survey (CFHTLS) Wide 
field data and the V IMOS VLT Deep Survey (VVDS; 
iLe Fevre et alJ (120051')') an d the DEEP2 spectroscopic 
survey (|Davis et all l|2007h ). Photometric redshifts were 



estimated for the CFHTLS Wide data from ima ges provided 



by th e CFHTLS- Archive-Research Su rvey IIErben et al 



(2009)) with the method described in iHildebrandt et al 



(2009), and those photometric redshifts were then compared 
to secure spectroscopic redshifts. This process yields an 
accuracy of the photometric redshifts for galaxies with 
magnitudes i' < 24 of CAwn+z) ~ 0.047 after rejection of 
2.8% of outliers. 

We confine the redshift range of galaxies to z ^ 1.1 
as that is the region where our photometric redshifts are 
most accurate and not badly affected by biases. Common 
objects in the overlap regions of both CFHTLS and VVDS 
and CFHTLS and DEEP2 were binned according to 
magnitude and redshift and the photometric redshift error 
(i.e. the rms of (2 p hotomctric — Spectroscopic) after rejection of 
outliers) for each bin was calculated and weighted by the bin 
number counts. A three-dimensional surface (magnitude, 
photometric redshift, and photometric redshift rms) was 
then found to be best fit with a 2 degree polynomial: the 
rms error of any new magnitude and redshift can then be 
extrapolated from this model (including an extrapolation 
of errors for magnitudes such as i! > 24 where deeper 
spectroscopic redshift catalogues do not currently exist 
to allow more accurate modelling of this high magnitude 
region). Using the Millennium Simulation mock lightcone 
data, a galaxy's magnitude and exact redshift was used 
in conjunction with the aforementioned rms model to 
calculate a typical observational photometric redshift error 
rms for that particular galaxy, from which a Gaussian 
distribution was created. The assumption of a Gaussian 
error distribution is supported by the comparisons between 
photometric redshifts and spectroscopic redshifts mentioned 
above (no significant secondary peak or skewness was found 
as confirmed by the low number of outliers stated above and 
an overall vanishing photo-z bias respectively). Randomly 
assigning a photometric redshift error to a given galaxy 
was then accomplished by randomly drawing an error value 
from the Gaussian distribution with the rms modelled for 
that galaxy. The redshift distribution of mock lightcone 
galaxies before and after the redshift errors were applied 
are shown in Figure [3] (where the data have also been cut 
at a limiting i! — band magnitude of 25.5 to mimic real 
observational data, and are only shown out to a redshift 
of 1.1 although, as mentioned, the lightcones themselves 
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0.2 0.4 0.6 0.8 1.0 1.2 

Redshift of Simulotion Galaxy Cluster 

Figure 3. Redshift distribution of the 6 Millennium Simulation 
mock lightcones (KW07) as exact redshifts (solid histograms) and 
with r edshift errors mode l led fr om the photometric redshift er- 
rors of iHilde brandt ct al. (2009) (blue, dashed histograms). Er- 
rors were derived from comparisons between the CFHTLS with 
VVDS and DEEP2 overlap regions; an explanation of the error 
modelling can be found in the text (Section 14.21 1. The data have 
been cut at a limiting i'— band magnitude of 25.5 to mimic real, 
observationally limited data, and are only shown out to a redshift 
of 1.1 as explained in the text. 



are much deeper). The redshift error scales with redshift; 
selecting redshift slices of 0.2 in width ensures our redshift 
slicing is larger than the rms of ~ 0.1 at a redshift of 1.1. 

4-2.1 Fine-Tuning 3D-MF Parameters 

Investigating the use of various redshift slice widths with 
3D-MF, as well as examining the effects of introducing 
redshift errors into the data, and the corresponding recovery 
rates of galaxy clusters, leads us to be able to fine-tune 
3D-MF's parameters and select an optimal redshift slicing. 

An examination of the completeness (the fraction of 
known clusters recovered by 3D-MF) as a function of mass 
and redshift slice width is essential to recover as many 
clusters as possible. Figure [4^ presents a few variations 
of redshift slice width as evidence that an analysis of the 
mock lightcones with 3D-MF set to use redshift slice bin 
widths of 0.2 in size recovers the most known clusters 
with Af2oo ^ 5 x 10 13 A/q. Repeating this analysis with 
photometric redshift errors shows that completeness tends 
to go up in almost all cases when running 3D-MF separately 
with various redshift slice widths (see Figure |4p) but the 
significance of detections against the background goes down 
(as will be shown). 

Maintaining a constant redshift slice width, it is of further 
interest to note the effect on completeness when the limiting 
magnitude of the survey is changed. Figure [5] presents a 
constant redshift slice width of 0.2 with varying survey 
depths of 23.5 to 25.5 il — band magnitudes. As expected, 
the ability to find all clusters is significantly reduced as 




1x10* 2x10* 
Mass of Simulation Galaxy Cluster (M !00 ); x10'° MO 




1x10* 2x10* 

Mass of Simulation Galaxy Cluster (M !0D ); x10'° MO 



3x10* 



Figure 4. Completeness with respect to mass, measured as a 
fraction of known clusters found by 3D-MF. The input data was 
sliced into overlapping redshift slices as explained in the text. 
Overlapping slices are in each case shifted by half of the redshift 
slice width. Figure [4^, presents exact redshifts while Figure |4p 
includes redshift errors in the data. 




ting magnitude 25.5 
ting magnitude 24.5 
ting magnitude 23.5 



1x10* 2x10* 
Mass of Simulation Galaxy Cluster (M !00 ); xl0'° M0 



Figure 5. Similar to Figure |4p (completeness with respect to 
mass, measured as a fraction of known clusters found by 3D-MF), 
but here the width of the redshift slices is held fixed and the effect 
(on the fraction of galaxy clusters found) caused by changing the 
i'— band limiting magnitude of the survey is shown. 
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Figure 6. Comparing 3D-MF's real, true galaxy cluster detec- 
tions with false detections as a function of detection significance 
(down-sloping lines represent no redshift errors, while up-sloping 
lines represent data with redshift errors). 



the survey depth decreases: fewer galaxies are present in 
shallower data making their parent clusters improbable or 
impossible to optically detect. Interpretations associated 
with cluster masses below ~ 5 x 10 13 Mq will be addressed 
throughout Section T4.3I and the complications in this region 
will be addressed. 

Returning to a discussion of significance levels, ignor- 
ing any detections below the 3.5<r level is a sensible 
decision: the probability of a 3a positive curvature peak 
from a background G aussian field, due to chance, is 4% (see 
IVan Waerb ekc (2000)). Effectively, desiring a contamination 
due to random fluctuations of less than 1%, we have to be 
more conservative and accept only those cluster detections 
above 3.5a. Figure [6] investigates significance and clearly 
shows false detections were not found at a high level of 
significance when compared to known clusters accurately 
detected by 3D-MF. Low false detection significances are 
encouragingly still seen in Figure [6] with the introduction 
of photometric redshift errors (and Figure [6] shows, as 
expected, redshift errors reduce detection significances in 
general). If we cut at a higher significance level, such as 
6.5a, we could remove > 95% of our false detections, but 
we would also lose many lower significance, real detections; 
a trade-off must be reached and thus 3.5a was chosen. 



4.3 Final Selection Functions 

Having determined appropriate parameter settings, we now 
present 3D-MF's selection functions. The cumulative mass 
function of galaxy clusters recovered by 3D-MF is presented 
in Figure[7] An examination of this figure shows the recovery 
rate of known clusters to be very accurate above an M200 of 
~ 5 x 10 13 Mq; results that are encouraging for 3D-MF, and 
furthermore are consistent for all redshifts (see Figure [8}. 

Taking the optimal redshift bin width of 0.2 (as per 
Section I4.2.1|l and examining completeness with respect 
to redshift results in fractional recovery rates of known 
clusters per redshift that are constant in each mass range, 
with excellent recovery (i.e. > 88%) for clusters with 
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Figure 7. Cumulative mass function for galaxy clusters from the 
Millennium Simulation KW07 catalogue (as per Tabled] redshift 
errors described in the text). 
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Figure 8. Completeness with respect to redshift, measured as a 
fraction of known clusters found. 



~£ 1.0 x 10 1 A/q. Completeness rates per mass, 
multiple detection rates, and false detection information 
are summarised in Table [3] 3D-MF has a 100% recovery 
for clusters with Af 20 o > 3.0 x 10 14 A'/©, 97% of clusters are 
found with masses above 2.5 x 10 14 Mq, 88% recovery rates 
are seen for M200 > 1.0 x 10 14 M© as mentioned, and 72% 
of clusters are found in even lower mass ranges (numbers 
quoted pertain to the presence of photometric redshift 



errors) . 



For clusters with redshifts in the range 0.2 ^ z ^ 1.0 
and A/200 ^ 1.0 x 1O 14 M0, the galaxy cluster number 
density in the Millennium Simulation KW07 catalogues is 
~ 18 galaxy clusters per square degree; 3D-MF finds ~ 17 
galaxy clusters per square degree (using exact redshifts) and 
~ 16 galaxy clusters per square degree (using redshifts with 
errors) for this same redshift and mass range. Furthermore, 
for clusters with 1.5 x 1O 13 M M 20 o < 1.0 x 10 14 M Q , 
the galaxy cluster number density in the Millennium 
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Table 3. Completeness (percent of total known clusters found by 3D-MF) and multiple and false detection rates for 3D-MF on KW07 
simulation galaxy clusters (photometric redshift errors described in the text). The Detected Galaxy Clusters section reports the number 
of clusters detected at least one time, the Multiple Detections section reports additional detections, and the False Detections section 
presents the fraction of total 3D-MF detections not matched to known clusters above 1.5 X 10 13 Mq. 



Detected Galaxy Clusters 



Mass Range 


Number of 


No photometric redshift errors 


With photometric redshift errors 




Known Clusters 


Number 


Completeness 


Number 


Completeness 


^ 3.0 X 10 i4 M Q 


16 


16 


100.0% 


16 


100.0% 


S> 2.5 x 1O 14 M 


32 


32 


100.0% 


31 


96.9% 


> 1.0 x 1O 14 M 


208 


197 


94.7% 


184 


88.5% 


> 5.0 x 10 13 M@ 


637 


548 


86.0% 


456 


71.6% 


Ss 1.5 x 10 13 M Q 


3240 


1775 


54.8% 


1358 


41.9% 



Additional Multiple Detections 







Percent of total 




Percent of total 






Cluster Detections 




Cluster Detections 


> 3.0 x 1O 14 M 


14 


0.4% 


21 


0.7% 


^ 2.5 x 1O 14 M 


28 


0.8% 


39 


1.4% 


> 1.0 x 1O 14 M 


208 


6.1% 


227 


8.1% 


> 5.0 x 1O 13 M 


504 


14.8% 


439 


15.6% 


^ 1.5 x 1O 13 M 


1253 


36.7% 


1020 


36.2% 


False Detections 


> 1.5 x 1O 13 M 


383 


11.2% 


438 


15.6% 



Simulation KW07 catalogues is ~ 258 galaxy clusters per 
square degree; using exact redshifts, 3D-MF finds ~ 134 
galaxy clusters per square degree, and when photometric 
redshift errors are added, 3D-MF finds a cluster number 
density of ~ 100 galaxy clusters per square degree. 

The multiple detection rate, measured as the fraction 
of total 3D-MF detections matched to known clusters that 
are already matched to a 3D-MF detection, for clusters with 
M 20 o ^ 1-5 x 1O 13 M , is 36.7% when exact redshifts are 
used and 36.2% when redshift errors are added to the data. 
This multiple detection rate is what would be expected 
considering our redshift slices overlap by 0.1 in redshift: 
many clusters are included wholly in both original and 
shifted slices and, depending on their simulation redshift, 
are found by either preferentially matching our fiducial 
cluster (recall Section 13. ip in the original or shifted slices, 
or both. 

False detection rates are reported as the fraction of 
total 3D-MF detections not matched to known clusters 
above 1.5 x lO 13 Af ; when simulations have photometric 
redshift errors added, the false detection rate increases a 
mere 4.4% from the non-error photometric redshift case to 
15.6% false detections overall. Since photometric redshift 
errors scatter galaxies to random redshift slices they are not 
expected to induce increased clustering. False detections 
can be further analysed; they are shown in Figure [9] to be 
a fairly uniform function of redshift, with the exception of 
the redshifts at the edges of our redshift range (z — 0.2 
and 1.0). It is likely that some of these edge redshift false 
detections are in fact true detections of simulation clusters 
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Figure 9. 3D-MF false detections, as a fraction of total 3D-MF 
detections, shown as a function of redshift. 

with redshifts < 0.2 or > 1.0. This needs to be investigated 
further, but note that regardless of the redshift range 
chosen, there will always be potential contamination from 
galaxy members of simulation clusters whose centres lie 
outside the range of interest but whose presence is enough 
to cause a 3D-MF detection. Recall that we had reasons to 
select this redshift range: introducing redshift errors into 
the simulations was confined to a redshift region where 
photometric redshift errors could be accurately modelled 
(see Section |4~2|) . 

As previously discussed, 3D-MF measures the significance 
of each detection above the background galaxy distribution 



12 Milkeraitis et al. 




a of Galaxy Cluster Detection 

Figure 10. Significance of galaxy cluster detections in all 6 sim- 
ulation lightcones. Only detections ^ 3.5o- are considered ac- 
tual real detections. Known clusters are not considered below 
1.5 X 10 13 Mq (a sensible mass cut, well below the divergence 
from expectation as shown in Figure [3 . The redshift bins in the 
plot do not reflect redshift slicing, but rather are found clusters 
binned by their redshifts. 



(Equation 0). False detections as a function of the detection 
significance are presented in Figure [5] with the significance 
levels at which known clusters were accurately detected: 
many more highly significant clusters were found which were 
real detections, rather than false detections. Introducing red- 
shift errors does lower the significance of detections, but as 
Figure 2] shows, completeness remains high. 

4.3.1 Does 3D-MF Have a Proxy for Mass? 

The significance of cluster detections 3.5a) as a function 
of cluster mass is presented in Figure 1101 Focusing on 
clusters with masses > 10 14 Mq we see higher mass clusters 
being detected with higher significance as one might expect. 
For lower mass clusters we find the detection significance 
is not correlated with mass and we believe this is due 
to a number of reasons. We looked at the few cases of 
> 10(J detections matched with known clusters of masses 
< 2 x 10 13 Mq and found that while the closest known 
cluster to the detection was a low mass cluster, in the 
majority of cases galaxy interlopers from a nearby more 
massive, > 1O 14 M0, cluster were more likely responsible for 
the significance of the detection. In addition, as discussed 
in Section \4. II 3D-MF is not fine-tuned for interpreting the 
detection of low mass clusters as the radial filter is currently 
much larger than the extent of these low mass clusters. 
3D-MF assigns all galaxies within a IMpc radius around a 
peak detection to a cluster; for low mass clusters, galaxies 
within this radius and within a redshift slice, but outside 
the low mass cluster halo, will add noise to a measurement 
of significance. This is demonstrated in Figure [11] which 
presents the number of galaxies found by 3D-MF per known 
cluster as a function of cluster mass. 



The left panel of Figure \TT\ (Figure lllb ) shows the 
results for exact redshifts and the right panel (Figure lllb ) 
shows the results when photometric redshift errors are 
included. For clusters with M200 > 5 x 1O 13 M , 3D-MF 
correctly measures the number of galaxies belonging to 
a cluster. However, for lower mass clusters we see the 
measured galaxy cluster member counts unable to go below 
~ 30 galaxies per cluster (blue crosses), which is the average 
number of galaxies that fit 3D-MF's radial profile within a 
IMpc window. Reducing the cutoff radius, r co , in Equation 
would improve the association of correct galaxies to 
low mass clusters but this would significantly reduce the 
high rate of success shown for higher mass cluster detections. 

The introduction of redshift errors moves the points 
in Figure lllfa leftward, meaning that as cluster members 
become scattered in redshift space, 3D-MF associates less 
of them to their parent cluster, as would be expected. In- 
terestingly, for the lowest mass clusters, which are the most 
difficult to detect, this means galaxy number counts are 
less likely to be overestimated while remaining detectable 
with 3D-MF. 

Examining the number of galaxies per cluster, as de- 
termined by 3D-MF, as a potential proxy for mass is shown 
in Figure [T^l Again focusing on the clusters with masses 
> 1O 14 M we see a weak redshift dependent trend: the 
most massive clusters have the most numerous members. 
For the low mass clusters however, owing to the incorrect 
assignment of too many galaxies to those clusters, we see 
no correlation. 

We will investigate not having a clear proxy for mass 
via 3D-MF in future work. P96, D07 and 01-4 assume 
that all light after background subtraction follows the 
Schechter function and that this can be expressed as a 
luminosity equivalent to the number of L* galaxies (which 
they refer to as A c ;), with good results, but accurate galaxy 
number counts per cluster is less of an issue with the lower 
cluster densities and higher mass clusters their methods 
typically find. We plan to further investigate our currently 
overestimated cluster galaxy members in the low mass 
ranges, by looking at galaxy number counts per cluster, 
and potentially optimising a variable cutoff radius in the 
3D-MF search window. 

Methods complimentary to 3D-MF would work as mass 
estimators as well; weak gravitational lensing techniques 
for example c an be utilised on real data to measure cluster 
masses (as in iHoekstral feoOT). Alternatively, spectroscopic 
redshifts could be utilised to precisely identify galaxy 
cluster members and potentially lead to determining a 
better contamination fraction in the current relationship 
between 3D-MF galaxy cluster member number counts and 
cluster mass. X- ray information is known to be an excellent 
proxy for mass l|Smith et al.l l2005). and Sunyaev-Zel'dovich 
(SZ) information could be added to tell us mor e about the 
struct ure o f clusters along the l ine-of-sight (as in lMotl et al.l 
(|2005h and lSealfon et all (|2006l ) for example). 
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Figure 11. The number of galaxies found per known cluster as a function of cluster mass (the dashed horizontal line denotes the 
definition in Tabic [T] which requires clusters have ^ 5 galaxy members). Figure ITTh presents exact redshifts while Figure [Tib includes 
redshift errors in the data. 
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Figure 12. The range of cluster masses and the number of galax- 
ies associated to those cluster masses by 3D-MF. 

4-3.2 Cluster Catalogue Purity 

As previously discussed, it is not an easy task to define 
the centre position of a cluster in an automated way. 
By construction, 3D-MF does not have a cluster centre 
predefined. Other cluster finding methods, such as those 
mentioned in Section [T] use the position o f the brightest 
cluster galaxy member dKoester et al.| [2007), or the centre 
of the distribution of red galaxies for example. We choose 
the centre of a galaxy cluster to be based on the luminosity 
and radial distribution of cluster galaxies. As shown in 
Figure ITU for clusters with M 20 o > 5 x 10 13 M Q , 3D-MF 
correctly associates galaxy members to their parent cluster, 
and we thereby expect the cluster centroid to be determined 
by this method with reasonable accuracy. For lower mass 
clusters however, owing to interloping galaxy members from 



nearby clusters (interlopers that fall within a radial profile 
centred near the lower mass cluster), the centroid that we 
measure is rather difficult to interpret. 

Recall that a 3D-MF cluster detection within 0.75Mpc of 
another simulation matched 3D-MF cluster detection (a 
3D-MF detection that was chosen to be the match as it 
was closer to the known cluster centre) is considered a 
secondary detection. This further complicates the issue of 
centroiding, as it is not clear whether the 3D-MF centroid 
for clusters with secondary detections should be calculated 
from the galaxies associated to both the primary and 
secondary detection or just the primary, especially consid- 
ering the fact that clusters often have substructures (and 
perhaps in reality don't have a clear centre). In the analysis 
that follows we choose the primary detection as the centroid. 

Figure [T3] shows the number of 3D-MF derived clus- 
ters matched to known simulation clusters as a function of 
the distance between their centres. For all mass ranges we 
find there are fewer 3D-MF cluster centre positions with 
increased radius from known cluster centres, showing that a 
3D-MF centroid measurement is frequently quite accurate. 
The elevated plateau at a matching radius > 1 arcminute, 
seen in the 1.5 x 1O 13 M ^ h'hoo < 5.0 x 10 13 M Q mass 
range, indicates some level of contamination in our low 
mass matched sample. 

To investigate this, we randomised the cluster centroids 
found by 3D-MF in RA and Dec and repeated our match- 
ing to known simulation cluster centre positions. The 
average random matches for all cluster mass ranges, from 
50 randomised trials, is presented in Figure Q3] as the 
dotted line, which we call matching noise. Over 80% of the 
contribution to this matching noise comes from low mass 
clusters (1.5 x 10 13 M Q ^ M 20 o < 5.0 x lO 13 Af ) and is 
thus a major source of the contamination in the low mass 
sample's plateau (red, triangle line) in Figure 1131 This is 
what we would expect: low mass (and thus small) clusters 
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Figure 13. Distance of 3D-MF derived cluster centres from pre- 
determined, known simulation centres (as denned in Section l2.2l l. 
The dotted line shows results from 3D-MF derived clusters ran- 
domised in RA and Dec before matching to known simulation 
clusters according to our matching criteria: over 80% of the con- 
tribution here is from the lowest (i.e. 1.5 X 1O 13 M0 ^ M200 < 
5.0 X 1O 13 M ) mass range (see text for further discussion). 



contribute to the matching noise increasingly at larger radii 
from known KW07 centres since randomising their 3D-MF 
found positions should uniformly place their detections 
across the field, and with any size of matching window 
there will always be some of these low mass clusters (of 
high number density) matched to a known cluster. 

The matching noise we have shown is an overestimate 
of the real matches due to noise because randomising 
the 3D-MF cluster positions in RA and Dec eliminates 
properly identified multiple detections of single clusters, 
thereby overestimating single detections and maximally 
increasing the matching due to chance incorrect matches. 
For example, a cluster that is detected by 3D-MF in one 
original redshift slice and also in the overlapping redshift 
slice would be considered a multiply detected cluster. After 
randomisation of the cluster positions, this would now 
appear as two separate cluster detections in the original 
and shifted redshift slices. To correct for this, we use our 
multiple detection rates of 36.2% and conservatively report 
a maximal upper limit on our false detections of 24.3% 
(i.e. [(Total number of false detections + (1 — 0.362) x 
Total number counts due to an upper limit of matching 
noise) /Total 3D-MF detections]). As discussed at the 
beginning of this section, we believe the false detection rate 
(especially for clusters of masses > 5.0 x 10 13 A/ Q ) is closer 
to 15.6%, but for the higher number densities of lower mass 
clusters this could be up to 24.3%. 

Due to the increased sensitivity of 3D-MF over two- 
dimensional matched-filter methods, we believe 3D-MF 
is correctly recognising the significance of low mass clus- 
ters above background noise, but due to excess galaxy 
number counts per cluster, and oversized fiducial cluster 
filters, has trouble correctly choosing a cluster centroid 
for these low mass clusters. We can safely confirm from 
this centroid analysis that we have chosen a physically 
motivated and sensible matching radius within which to 



properly match 3D-MF's detections with known simulation 
cl usters. Interestingly, the tw o-dimensional matched-filter 
of IWhite fc Kochanekl {2002 ) successfully implements an 
iterative centroiding technique: a logical next step for 
improving the current implementation of 3D-MF and its 
ability to determine cluster centres. 



5 CANADA-FRANCE-HAWAII TELESCOPE 
LEGACY SURVEY DEEP FIELD CLUSTER 
CATALOGUES GENERATED BY 3D-MF 

5.1 Canada- France-Hawaii Telescope Legacy 

Survey Deep Data and Catalogue Generation 

For the current work we use the four lxl square 
degree Canada-France-Hawaii Telescope Legacy Survey 
(CFHTLS) Deep fields (Dl,2,3,4) and COSMOS data which 
were presented within the CFH TLS- Archive- Researc h Sur- 
vey; we refer the rea der to lErben et all (2009) and 
Iffildebrandt et al.l (2009) for details on the data processing 
and multi-colour catalogue creation. 

Our five-band multi-colour and photometric redshift 
catalogues are based on CFHTLS i' — band object detec- 
tions. The seeing for this band varies between 0.71 and 0.82 
arcseconds within the four fields and we reach a la limiting 
magnitude of lAB.iim ~ 29.5 within a diameter of twice the 
seeing disk. Photometric redshifts were est i mated with the 
publicly availa ble code BPZ (see Bem'tezl (|2000| ) and the 
prescription in iHildebrandt et all (I2009T )). In the fields Dl, 
D2 and D3 we quantified the accuracy of our photometric 
redshifts with s pectro scopic redshift s from th e VVD S 
(|Le Fevre et al.l j2005l)). zCOS MOS (|Lillv et all l|2007l )). 
and DEEP2 l|Davis et all l|2007h ) respectively. Within the 
magnitude range of 17 <Y-band < 24 we estimate unbiased 
photometric redshifts, and after rejecting problematic 
sources (we cut with t he BPZ ODDS parameter fo r sources 
with 0DDS> 0.9: see also Iffildebrandt et alj (|2009l )). we find 
a photometric - spectroscopic redshift scatter of a ~ 0.033 



of the quantity Az = (z p hot 



c )/(l + Zspcc). Our outlier 



rate with \Az\ > 0.15 is 1.6%. Our final Deep catalogues 
contain ~ 1.23 x 10 5 galaxies per square degree with 
i'— band magnitudes < 25.5 (compared to the simulations 
with z < 1.2 and the same magnitude limit, which had 
~ 1.59 x 10 galaxies per square degree). 



5.2 CFHTLS Deep Galaxy Cluster Catalogues 
Courtesy of 3D-MF 

3D-MF was set up according to the parameters in 
Table [2] but an appropriate M* and a were de- 
rived from the CFHTLS Deep fields and found to be 
M *'-band = 22.46 ± 0.15 and a = -1.005 ± 0.021. 3D-MF 
was then run on the CFHTLS Deep fields. Table [4] contains 
8 random entries (2 per field) from our CFHTLS Deep 
galaxy cluster catalogue; the entire catalogue is available 
upon request to the authors. From our Millennium Sim- 
ulation tests we would expect to see ~ 16% — 24% false 
positives in this catalogue, distributed mostly in the lower 
mass ranges according to the selection functions in Section 
13] Using our multiple detection criteria, we found 37.6% of 
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Table 4. 3D-MF Derived CFHTLS Deep Galaxy Cluster Catalogue. 



CFHTLS Deep Galaxy Clusters 


Deep 


RA 


Dec 


Redshift 


Significance 


Grouping 


Identification Number 


Field 










Same Redshift 


Overlapping Redshift 


Dl 


02:25:07.536 


-04:01:47.892 


0.2 


7.16 






Dl 


02:24:26.040 


-04:52:29.676 


0.9 


5.21 


2 


58 


D2 


09:58:54.720 


01:54:57.744 


0.4 


5.35 






D2 


09:58:53.759 


02:14:16.188 


0.5 


5.15 




31 


D3 


14:17:22.802 


52:54:52.920 


0.2 


7.27 






D3 


14:18:19.440 


53:05:58.200 


0.5 


5.61 






Dl 


22:14:39.120 


-18:09:41.760 


0.6 


5.23 




37 


D4 


22:16:08.400 


-18:11:48.840 


0.4 


6.73 







...complete catalogue available upon request... 



Deep detections were duplicate detections of clusters (com- 
parable to the ~ 36% multiple detection rate found from 
our Millennium Simulation tests). Grouping Identification 
Numbers in Table [4] are numbered flags: cluster detections 
within 0.75Mpc of each other (recall Section 14.1.21 for 
details) in the same redshift slice are flagged with identical 
Same Redshift numbers, and cluster detections within a 
projected 0.75Mpc of each other in overlapping redshift 
slices are separately flagged with identical Overlapping 
Redshift grouping identification numbers (which can be 
propagated through more than one overlapping redshift 
slice if still within 0.75Mpc in projected radii of each other 
in overlapping slices). Note that Grouping Identification 
Numbers in both the Same Redshift and Overlapping 
Redshift numbering restart at zero for each Deep field. 

We use the significance of our detections to select the 
best galaxy cluster candidate from among multiple detec- 
tions, and excise the remaining multiple detections from the 
following discussion and analysis. The redshift distribution 
of 3D-MF found Deep galaxy clusters is shown in Figure 
1141 A comparison to other published CFHTLS Deep cluster 
lists via older matched-filter methodology ensues. 

5.3 Comparison to Other Published CFHTLS 
Deep Clusters Found Using Matched-Filter 
Methods 

lOlsen et al.l (|2007l ) (herein 03) published a matched-filter 
derived cluster list of the CFHTLS Deep fields. A subsequent 
paper in 2009 (jGrove et al.l (|2009l ): 04) examined running 
their matched-filter method (which does not utilise photo- 
metric redshifts) separately on different wavelength bands 
and then merging lists with the 2007 paper, a technique sim- 
ilar to P96's efforts to reduce false detection rates. Since the 
resulting lists are not too different, we choose to compare the 
i'— band 2007 03 cluster list to a cluster list found by 3D- 
MF, using the same i'— band data, in this paper. We focus 
on the 0.2 sj z ^ 1.0 range for clusters, keeping in line with 
our acceptable photometric redshift region for galaxies and 
our slicing methodology. The redshift distributions of the 03 
cluster list, after their 3.5cr cut, for all four CFHTLS Deep 
fields is shown overlayed on the 3D-MF results in Figure [Til 
Drastically fewer galaxy clusters were found by 03 at all red- 
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Figure 14. Redshift dist ribution of all 3D -MF found (dashed 
up-sloping histogram) and lOlsen et al. l lj2007T ) found (shaded his- 
togram) galaxy clusters in CFHTLS Deep fields 1 through 4. 

shifts; 3D-MF finds ~ 170 galaxy clusters per square degree 
(well within reason compared to the Millennium Simulation 
analysis cluster number densities in Section [4.3[1 compared 
to ~ 40 galaxy clusters per square degree in 03. 

In order to match the clusters found using both methods, a 
two-dimensional tolerance radius of 0.044 degrees (akin to 
the matching strategy in Section I4.1.1|l was placed around 
each 03 cluster centre position and the closest 3D-MF de- 
tection in RA and Dec was considered a match. Figure 1151 
shows the redshift distribution, and Figure [TBI presents the 
redshift differences, of the two-dimensional matches between 
the two cluster lists. 

As mentioned in Section l3.3.5l older matched-filter methods 
tend to wrongly estimate the redshift of clusters. 03 
searches the entire input catalogue (i.e. no redshift slicing) 
for a match to a fiducial cluster sized to match what would 
be expected at a particular redshift, and then repeats the 
process with a slightly re-sized fiducial cluster size (to 
match what would be expected for a cluster that existed 
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Figure 15. The redshift distribution of only those galaxy clusters 
found using both 3D-MF and 03 methods, matched to each other 
in two-dimensional space (see text for details). 3D-MF results are 
represented by the dashed up-sloping histogram, and 03 results 
are the shaded histogram. 
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Figure 17. Redshift distribution of galaxy clusters found using 
both 3D-MF and 03 methods, matched to each other in three- 
dimensional space (see text for details). The vertical scale of Fig- 
ure \TE\ is duplicated here for comparison purposes. 
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Figure 16. Differences in assigned galaxy cluster redshifts for 
clusters found using both 3D-MF and 03 methods (using two- 
dimensional matching as described in Section l5.3[ l 



at a slightly different redshift); 03 requires each cluster to 
be detected in two neighbouring fiducial cluster re-sizing 
searches in order to be considered further. This affects 
higher redshifts disproportionately, as they are less likely 
to be found in equally-stepped fiducial cluster-resizing 
bins than lower redshift clusters; higher redshift space of 
equivalent bin widths covers much more volume. Figure 
1151 shows the likely result of this: the 03 cluster sample is 
skewed toward lower redshifts. 

A second separate matching was performed between 
the 3D-MF Deep cluster list and 03; this time a three- 
dimensional tolerance range was considered around each 03 
cluster. There were less matches, as seen in Figure ITTl again 



Table 5. A Comparison of CFHTLS Deep galaxy clusters found 
with 3D-MF and pub lished CFHTLS Deep galaxy clusters from 
03 iOlsen et alj|2007l) . Cluster redshift ranges of 0.2 ^ z ^ 1.0 
are considered, keeping in line with this work, and galaxy clusters 
present in both catalogues with similar redshifts (as explained in 
the text) are reported again in the '03 & 3D-MF' column. 



3D Cluster Comparison 


CFHTLS 


3D-MF 


03 


03 


Deep Field 






& 3D-MF 


Dl 


161 


43 


32 


D2 


162 


38 


31 


D3 


190 


38 


25 


Dl 


160 


30 


23 


Total Dl-4 


673 


149 


111 



skewed toward the lower redshift end of the distribution. 

Comparing both the 3D-MF and 03 CFHTLS Deep cluster 
lists, there were 145 clusters found by both 3D-MF and 03 
out of 149 total 03 clusters (97%) with the two-dimensional 
matching criteria described. 528 additional galaxy clusters 
were found with 3D-MF. As mentioned, the matching 
process between 3D-MF clusters and 03 clusters was 
repeated considering a further third dimension in redshift 
space (see Table [5|; in this case there were 111 clusters 
found by both 3D-MF and 03, suggesting 34 of the 03 
clusters (23%) were not assigned the correct redshift by 
their algorithm. 

The completeness has been shown to be vastly differ- 
ent between the two methods: 3D-MF is 100% complete 
down to ~ 3 x 10 14 M Q , ~90% complete down to 1 x 1O 14 M 
and the false detections are likely concentrated even further 
down the cluster mass function at the low mass end. It was 
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discussed in general in Section T3. 2 1 that older matched-filter 
methods have false detection rates of ~ 30 — 65%, suffer from 
line-of-sight projection issues, wrongly estimate redshifts, 
and have not been tested on Millennium Simulation-like, 
realistic, extensive simulations. 



6 CONCLUSIONS 

3D-Matched-Filter (3D-MF) is an optimised, automated 
galaxy cluster finder, ideal for large optical astronomical 
datasets with multi-wavelength band coverage. It utilises a 
fiducial galaxy cluster's radial and luminosity profiles and 
searches the data for matches to this over a wide range of 
redshifts. Future surveys, such as LSST and JDEM, can ex- 
ploit 3D-MF's automated methodology and statistical ap- 
proach to produce complete and reliable galaxy cluster cat- 
alogues. 3D-MF is an improvement over other matched-filter 
techniques due to several improvements, including: 

• the cutting of input data into overlapping redshift slices 
to examine it piecewise, significantly reducing line-of-sight 
projections; 

• an implementation of mask-handling capabilities, im- 
proving edge-effects, and cluster detections near bright stars 
or saturated pixels; 

• an accurate modelling of data-dependent background 
galaxy counts; 

• the development of a new peak detection pipeline; 

as well as a list of other parameters that can be 
fine-tuned according to the dataset being examined. 

The Millennium Simulation mock catalogue lightcones 
were used to extensively test and improve 3D-MF, and se- 
lection functions for the algorithm were presented. Redshift 
errors mimicking real data were modelled and added to the 
simulations and their effect on the selection functions was 
derived. With redshift errors, and focusing on the cluster 
range 0.2 sC z sC 1.0, 3D-MF was found to recover 100% of 
known galaxy clusters with an M200 ^ 3.0 x 1O 14 M ; 97% 
of clusters with an M200 > 2.5 x 10 14 M Q ; 88% of clusters 
with an M 20 o ^ 1.0 x 10 14 M o ; and 72% of clusters with 
an M200 > 5.0 x 10 13 M Q . 36% of detections were multiple 
detections of clusters. False detections were determined to 
be occuring at a rate of 15.6% of the total cluster detections, 
and a subsequent analysis showed this to be concentrated in 
the low significance, low galaxy number (< 50) per cluster, 
and likely lower mass (M200 < 1 x 1O 14 M0) range (poten- 
tially increased by noise up to a conservatively reported 
rate of 24%). After selection functions were quantified (and 
the effects of adding redshift errors to the catalogues were 
analysed), 3D-MF was run on the four CFHTLS Deep 
fields. 3D-MF finds ~ 170 galaxy clusters per square 
degree in the Deep dataset: over 400% more, with a much 
lower false detection rate, and higher accuracy of redshift 
determination for true clusters, than found by other authors 
using two-dimensional matched-filter methods on the same 
£'— band data. 

For future work, there are subtle adjustments to 3D- 
MF that we are examining. A non-passively evolving 
^-correction could be applied, taking into consideration the 



effect of variation s in galaxy types. Mo re interestingly per- 
haps, as shown in lPopesso et al. (2005), galaxy clusters are 
often better fit with two Schechter functions as opposed to 
one: implementing this result may further improve 3D-MF. 
The radial profile used in 3D-MF's radial filter could also be 
tuned to try to find more low mass, smaller clusters (if pos- 
sible). Different filter bands could be used in 3D-MF when 
analysing the redshift slices, and results compared with the 
i! — band used herein, or spectroscopic redshifts could be 
utilised to confirm cluster detections and precisely identify 
galaxy cluster members. It would be sensible to try to ex- 
tend the high redshift range of the detection capabilities of 
3D-MF as well, although this method is ultimately limited 
by the build up of the luminosity function at redshifts > 1.2. 

The Millennium Simulation has proved invaluable in 
refining and understanding 3D-MF as an automated optical 
cluster finder. A few points were raised that we continue to 
study in fastidious detail: finding a 3D-MF derived proxy 
for cluster mass, refining the number of galaxies associated 
to a cluster, and improving cluster centroid determinations 
are examples. In the future we will introduce a weak 
gravitational lensing analysis to elucidate cluster masses 
and we are also investigating multi-wavelength coverage in 
X-ray and SZ data to refine cluster information. 
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